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INTRODUCTION: 

The  central  hypothesis  of  this  project  was  that  the  formation  of  a  tetrad:heptad  variant  of 
a  G-quadruplex  by  the  HER-2/neu  promoter  represses  HER-2/neu  transcription.  The 
tetrad:heptad  was  expected  to  provide  a  target  for  selective  drug-DNA  interaction  by  G- 
quadruplex  interactive  compounds  and  inducing  or  stabilizing  the  tetrad:heptad  in  the 
HER-2/neu  promoter  with  a  G-quadruplex  interactive  compound  will  silence  HER-2/neu 
expression. 

The  specific  aims  of  this  proposal  were: 

Specific  Aim  1:  Investigation  of  the  potential  biological  role  of  a  tetrad:heptad  DNA 
structure  in  the  HER-2/neu  promoter  in  the  regulation  of  HER-2/neu  gene  expression. 

Specific  Aim  2:  Investigation  of  the  ability  of  quadruplex  selective  DNA  interactive 
compounds  to  target  the  tetrad:heptad  structure  in  the  HER-2/neu  promoter  in  vitro  and  in 
vivo. 

In  order  to  accomplish  these  aims,  we  proposed  a  series  of  experiments  to  evaluate  the 
ability  of  G-quadruplex  interactive  compounds  to  bind  to  the  HER-2/neu  promoter  in 
solution  and  prevent  HER-2/neu  expression  in  cells.  We  proposed  to  identify  lead 
compounds  and  perform  biochemical  studies  to  further  characterize  the  structure  formed 
by  the  HER-2/neu  promoter  in  an  attempt  to  understand  the  potential  biological  role  of 
DNA  secondary  structure  in  the  regulation  of  HER-2/neu  expression. 

In  this  final  annual  report,  we  describe  the  development  of  a  tool  for  studying  the  HER- 
2/neu  promoter  with  G-quadruplex  interactive  compounds  or  other  compounds  that  may 
target  the  HER-/neu  promoter  or  its  GGA  repeat  region,  and  the  evaluation  of  one  G- 
quadruplex  interactive  lead  compound  in  this  model  system. 

The  Ebbinghaus  laboratory  also  published  two  related  papers  on  the  potential  role  of  G- 
quadruplexes  in  the  promoters  of  two  other  important  cancer  related  genes — HIE-1  alpha 
and  c-MYB — that  were  related  to  the  proposed  work  in  the  HER-2/neu  promoter.  These 
tangible  work  products  can  be  attributed  to  this  Army  contract. 

BODY: 

A.  HER-2/neu  promoter  studies.  The  Ebbinghaus  laboratory  constructed  a  model  system 
for  testing  the  role  of  GGA  repeat  region  in  controlling  the  expression  of  the  HER-neu 
promoter.  This  model  system  was  especially  well  suited  to  testing  whether  a  G- 
quadruplex  interactive  compound  that  inhibited  HER-2/neu  expression  was  binding  to  a 
G-quadruplex  formed  by  the  HER-2/neu  GGA  repeat  region. 

In  order  to  create  completely  isogenic  reporter  cell  lines  bearing  a  single,  integrated 
HER-2/neu  promoter-luciferase  reporter  gene,  we  transferred  the  native  or  mutant  HER- 
2/neu  promoters  together  with  the  entire  luciferase  coding  sequence  from 
pGE4.20/HNP386  (described  in  last  year's  annual  report)  into  the  pcDNA5/ERT  plasmid 
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(Invitrogen),  the  latter  modified  by  entirely  removing  the  CMV  promoter.  The  resulting 
vectors  are  named  pHNP386-luc/FRT  and  pHNP386(A-225)-luc/FRT. 

These  vectors  were  used  to  create  a  matched  pair  of  stable,  integrated  genes  in  host  cells 
bearing  a  fip  recombinase  target  sequence  (FRT)  by  using  the  fip  recombinase  enzyme.  A 
host  cell  line  derived  from  293  cells  was  commercially  available  from  Invitrogen  for  this 
purpose,  and  we  thus  constructed  a  matched  pair  of  293  cells  bearing  integrated  HER- 
2/neu  promoters  upstream  of  the  luciferase  gene  from  the  pGF4  series  of  vectors 
(Promega).  These  vectors  differed  only  in  a  6  bp  mutation  to  the  GGA  repeat  region  of 
the  HER-2/neu  promoter: 

Wild  type  (GGA  repeat  region):  tcacaggagaaggaggaggtggaggaggagggctgc 

A-225  mutation:  tcacaggagaaggaggaggtggaggcatgcggctgc 


Because  the  HER-2/neu  promoter-reporter  fragment  is  genomically  integrated  and  in 
single  copy,  these  cell  lines  could  be  directly  compared  for  differences  in  luciferase 
activity  as  a  reporter  for  HER-2/neu  gene  expression.  These  cells  could  then  be  treated 
with  any  compound  that  might  target  the  HER-2/neu  promoter,  and  luciferase  activity 
would  be  a  measurement  of  the  effect  of  the  gene  on  the  target.  Since  the  putative  G- 
quadruplex  depends  on  an  intact  GGA  repeat  sequence,  a  comparison  of  the  effect  of  a 
drug  on  native  versus  mutated  HER-2/neu  promoter-reporter  cells  would  indicate  whether 
the  drug  could  reasonably  be  assumed  to  bind  to  the  GGA  repeat  region.  A  CMV- 
luciferase  promoter-reporter  contruct  was  also  generated  and  genomically  integrated  at 
the  same  site  in  another  cell  line  to  serve  as  an  additional  control. 

A  G-quadruplex  interactive  lead  compound,  CX-1398  from  Cylene  (and  discussed  in 
prior  annual  reports)  was  applied  to  the  two  matched  cell  lines,  as  well  as  a  third  cell  line 
bearing  the  CMV  promoter-luciferase  contruct.  10-fold  serial  dilutions  of  CX-1398  were 
applied  to  the  cells  for  24  hours,  and  the  cells  were  lysed,  total  protein  measured,  and 
luciferase  activity  measured  with  a  luminometer.  Fuciferase  activity  was  adjusted  for 
total  protein  content  of  the  lysate  and  normalized  against  the  "no  treatment"  control  cell 
lysate.  The  results  of  this  experiment  are  shown  in  Figure  1  below. 
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Figure  1,  Treatment  of  HER-2/neu  reporter  cells  bearing  wild-type  (Wt)  or  mutant  (Mut) 
HER-2/neu  promoter-luciferase  constructs  with  the  G-quadruplex  interactive  compound 
from  Cylene,  CX-1398.  A  CMV  promoter-luciferase  contruct  was  treated  as  an  additional 
control. 

These  studies  showed  that  CX-1398  only  reduced  FIER-2/neu  driven  luciferase 
expression  compared  to  vehicle  (DMSO)  control  at  high  doses  (100  uM).  There  was  no 
differential  effect  of  CX-1398  on  mutant  versus  wild-type  HER-2/neu  promoters. 

We  conclude  that  the  model  G-quadruplex  interactive  compound  CX-1398  does  not 
inhibit  HER-2/neu  by  binding  to  a  G-quadruplex  in  the  promoter  region  of  this  oncogene. 
However,  we  do  believe  that  these  data  provide  a  useful  model  for  future  promoter 
studies  that  could  be  applied  to  the  study  of  other  genes  bearing  putative  G-quadruplex 
forming  sequences.  Eurthermore,  these  reporter  constructs  and  cell  lines  could  be  used 
directly  to  screen  for  compounds  that  might  interact  with  the  HER-2/neu  promoter,  in  a 
manner  analogous  to  the  previous  report  by  Scott  and  colleagues  (Mol  Cancer  Ther.  2002 
Apr;l(6):385-92). 

B.  c-MYB  G-quadruplex  promoter  studies.  The  Ebbinghaus  lab  identified  the  c-MYB 
gene  as  bearing  a  GGA  repeat  region  that  was  homologous  to  the  one  in  the  HER-2/neu 
promoter.  Indeed,  because  the  c-MYB  gene  contained  three  (GGA)4  repeats,  it  was  likely 
more  amenable  to  the  formation  of  a  stable  G-quadruplex  structure,  potentially  forming  a 
tetrad:heptad:heptad:tetrad  structure. 

The  laboratory  embarked  upon  studies  of  the  c-MYB  promoter  from  which  we  concluded 
that  Our  findings  show  that  the  G-quadruplex-forming  region  in  the  c-myb  promoter  is  a 
critical  cis-acting  element  that  may  repress  c-myb  promoter  activity  through  MAZ 
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binding  to  the  G-quadruplex  structure  (Nucleic  Acids  Res.  2008  Apr  1;36(6):  1755- 1769). 
These  data  are  among  the  first  to  show  that  a  known  transcription  factor  can  potentially 
bind  to  alternate  conformations  of  the  same  promoter,  and  provide  further  evidence  in 
support  of  the  role  of  G-quadruplexes  in  the  control  of  gene  expression. 

KEY  RESEARCH  ACCOMPEISHMENTS: 

Pre-award  through  End  of  Year  1 

o  Identification  of  a  GGA  repeat  region  of  the  HER-2/neu  promoter  capable  of 
forming  an  unusual  secondary  structure,  a  tetrad-heptad,  related  to  G-quadruplex. 
o  Biochemical  characterization  of  secondary  structure  formation  using 
oligonucleotides  representing  the  HER-2/neu  promoter. 

o  Identification  of  two  classes  of  compounds  (telomestatin,  a  natural  product,  and  a 
proprietary  fluoroquinolone  from  Cylene  Pharmaceuticals)  capable  of  binding  to 
the  tetradiheptad  structure  in  the  HER-2/neu  promoter  using  a  cell-free 
biochemical  assay  (DNA  polymerase  arrest  assay) 

o  Preliminary  demonstration  that  compounds  identified  in  the  cell-free  assay 
can  reduce  HER-2/neu  transcription  ex  vivo  (in  cell  culture) 

Year  2 

o  Construction  of  reporter  plasmids  bearing  the  HER-2/neu  promoter  and  specific 
mutations  that  disrupt  secondary  structure  formation  in  the  HER-2/neu  promoter, 
o  Transient  transfection  analysis  of  native  and  mutant  HER-2/neu  reporter 
plasmids. 

o  Identification  of  a  G-quadruplex  forming  region  of  the  HIE- la  promoter. 

o  Biochemical  characterization  of  secondary  structure  formation  using 
oligonucleotides  representing  the  HIE- la  promoter, 
o  Identification  of  two  classes  of  compounds  (telomestatin,  a  natural  product, 
and  TmPyP4,  a  cationic  porphyrin)  that  can  bind  to  the  G-quadruplex 
formed  by  the  HIE- la  promoter. 

Year  3 

o  Construction  of  stable,  single  copy,  integrated  reporter  cell  lines  bearing  the  HER- 
2/neu  promoter  and  specific  mutations  that  disrupt  secondary  structure  formation 
in  the  HER-2/neu  promoter. 

o  Evaluation  of  CX-1398,  a  proprietary  compound  from  Cylene,  identified 
as  a  lead  G-quadruplex  interactive  compound  potentially  capable  of 
binding  to  the  putative  tetradiheptad  structure  in  the  HER-2/neu  promoter 
during  year  1 . 

o  Identification  of  a  G-quadruplex  forming  region  of  the  c-MYB  promoter. 

o  Biochemical  characterization  of  secondary  structure  formation  using 
oligonucleotides  representing  the  c-MYB  promoter, 
o  Identification  of  MAZ  as  a  transcription  factor  that  can  recognize  alternate 
DNA  conformations  in  the  c-MYB  promoter. 
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2004  (Abstract  3863). 

4.  Memmott  R,  Basye  J,  Hurley  L,  Ebbinghaus  SW.  Eormation  of  a  novel  G- 
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the  Hypoxia  Inducible  Eactor  la  Promoter.  Biochemistry  44(49):  16341-50,  2005. 

2.  Palumbo  SL,  Memmott  RM,  Uribe  DJ,  Krotova-Khan  Y,  Hurley  EH,  Ebbinghaus 
SW.  A  novel  G-quadruplex-forming  GGA  repeat  region  in  the  c-myb  promoter  is 
a  critical  regulator  of  promoter  activity.  Nucleic  Acids  Res,  2008  Apr 
1;36(6):1755-1769. 

CONCLUSION: 

Although  we  identified  two  classes  of  compounds  that  appeared  to  have  some  effect  on 
HER-2/neu  expression  and  were  also  capable  of  binding  to  G-quadruplex  DNA 
simulating  the  putative  G-quadruplex  structure  in  the  HER-2/neu  gene,  we  were  unable  to 
convincingly  demonstrate  that  a  putative  G-quadruplex  structure  was  likely  to  be  present 
in  the  HER-2/neu  promoter  because  of  ambiguity  in  the  outcome  of  the  biochemical 
(cell-free)  and  in  vitro  studies  in  cell  lines. 

In  related  work  attributable  to  the  Army  contract,  we  more  convincingly  demonstrated 
the  presence  of  G-quadruplex  structures  in  critical  cis-elements  of  the  HIE-1  alpha  and  c- 
MYB  promoters,  leading  to  publications  in  Biochemistry  and  Nucleic  Acids  Research. 

APPENDIX: 

Reprint  of  Nucleic  Acids  Res,  2008  Apr  1;36(6):  1755- 1769. 
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ABSTRACT 

The  c-myb  promoter  contains  multiple  GGA  repeats 
beginning  17  bp  downstream  of  the  transcription 
initiation  site.  GGA  repeats  have  been  previously 
shown  to  form  unusual  DNA  structures  in  solution. 
Results  from  chemical  footprinting,  circular  dichro- 
ism  and  RNA  and  DNA  polymerase  arrest  assays  on 
oligonucleotides  representing  the  GGA  repeat  region 
of  the  c-myb  promoter  demonstrate  that  the  element 
is  able  to  form  tetrad:heptad:heptad:tetrad  (T:H:H:T) 
G-quadruplex  structures  by  stacking  two  tetrad:hep- 
tad  G-quadruplexes  formed  by  two  of  the  three 
(GGA)4  repeats.  Deletion  of  one  or  two  (GGA)4 
motifs  destabilizes  this  secondary  structure  and 
increases  c-myb  promoter  activity,  indicating  that 
the  G-quadruplexes  formed  in  the  c-myb  GGA  repeat 
region  may  act  as  a  negative  regulator  of  the  c-myb 
promoter.  Complete  deletion  of  the  c-myb  GGA 
repeat  region  abolishes  c-myb  promoter  activity, 
indicating  dual  roles  of  the  c-myb  GGA  repeat 
element  as  both  a  transcriptional  repressor  and  an 
activator.  Furthermore,  we  demonstrated  that 
Myc-associated  zinc  finger  protein  (MAZ)  represses 
c-myb  promoter  activity  and  binds  to  the  c-myb 
T:H:H:T  G-quadruplexes.  Our  findings  show  that  the 
T:H:H:T  G-quadruplex-forming  region  in  the  c-myb 
promoter  is  a  critical  c/s-acting  element  and  may 
repress  c-myb  promoter  activity  through  MAZ  inter¬ 
action  with  G-quadruplexes  in  the  c-myb  promoter. 


INTRODUCTION 

The  c-niyb  proto-oncogene,  the  cellular  hoinologue  of  the 
transforming  v-myb  oncogene  of  avian  leukemia  viruses. 


encodes  a  critical  transcription  factor  for  proliferation, 
differentiation  and  survival  of  haematopoietic  progenitor 
cells  (T).  High  levels  of  the  gene  product  c-Myb  prevent 
haematopoietic  stem  cells  from  both  differentiation  and 
apoptosis  (2,3).  Because  of  the  critical  role  of  c-Myb  m 
determining  cell  fate,  c-Myb  e.xpression  levels  are  tightly 
controlled  in  normal  cells,  showing  high  levels  in 
immature,  proliferating  haematopoietic  cells  and  unde¬ 
tectable  levels  in  differentiated  cells  (4).  c-Myb  is  over 
expressed  in  many  leukemias  and  some  solid  tumors,  and 
plaj's  a  critical  role  in  letikemogenesis  by  maintaining  cells 
in  a  proliferative  stale  and  by  preventing  terminal 
differentiation  (5-7).  huger  and  coworkers  showed  that 
an  antisense  oligonucleotide  (ODN)  against  c-myb 
mRNA  could  eliminate  leukemia  cells  as  a  bone  marrow 
purging  agent,  resulting  in  cytogenetic  remissions  in  CML 
and  showing  that  c-myb  is  a  potential  therapeutic  target 
for  leukemia  treatment  (8). 

Cellular  levels  of  c-myb  are  regulated  at  the  transcrip¬ 
tional  level  by  several  mechanisms.  One  important 
mechanism  is  blocking  mR.NA  elongation  in  intron  I, 
which  attenuates  c-myb  mRN.A  elongation  in  a  cell  type- 
dependent  manner  (9-  1 2).  c-Myb  transcription  is  also 
controlled  by  a  number  of  transcription  factors.  For 
example. ’c-lvlyb  acts  as  a  negative  regulator  of  its  own 
expression  in  a  lineage-dependent  manner  (1 3).  WTl 
(Wilms  Tumor  I),  MZFT  (myeloid  zinc  finger  I)  and  PU.l 
also  down  regulate  c-myb  promoter  activity,  and  the  Ets 
and  c-.Iun/JuiiD  transcription  factors  activate  the  c-myb 
promoter  in  a  lineage-specific  manner  (Figure  I  A)  (1 4-  1 8). 

The  c-myb  promoter  contains  a  purine-rich  region  with 
three  copies  of  four  GGA  repeats.  [3(GGA)4].  located  17 
bps  downstream  of  the  transcription  initiation  site  on 
the  bottom  strand,  and  the  GGA  repeat  region  is  highly 
conserved  in  mammals.  Nuclear  magnetic  resonance 
(NMR)  and  molecular  modeling  studies  showed  that 
(GGAlj  DNA  sequences  similar  to  those  found  in  the 
human  c-myb  promoter  can  form  an  unusual  .secondary 
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5'  TCCTCCTCCTCCTTCTCCTCCTCCTCCGTGACCTCCTCCTCC 
•  3  -AGGAGGAGGAGGAAGAGGAGGAGGAGGCACTGGAGGAGGAGG 


Region  3  Region  2  Region  1 

Murine:  3'  -AGG AGGAGG AGGAAGAGGAGG  . AAGAGGAGGAGGAGGAG 

Rat:  3'  -AGGAGGAGGAGGAAGAGGAGG  . AAGAGGAGGAGGAGGAG 

Dog:  3'  -AGGAGGAGGAGGAAGAGGAGGAGGAGGCGCTGGAGGAGGAGG 
Domestic:  3'  -AGGAGGAGG AGGAAGAGGAGGAGGAGGCGCTGGAGGAGGAGG 
Cow 


Figure  1.  The  c-myb  promoter  and  key  transcription  factors  that  reguiato  c-niyb  expression  and  G'qnadnipicKcs  formed  by  2(GGA)4  or 

(GGA)s  (19-22).  (A)  Transcription  initiation  site  (-^1)  is  indicated  with  :in  arrow,  inliihitors:  c-Myb  (—596- — 618).  WT!  (—390  -'  -428).  myeloid 
zinc  linger  1  (MZi'-l.  — 50  -- -64  and  -162'- -178)  and  PV.]  binding  site  (  a- 1 5 --  -  20).  Activators:  bts  (-25 1  -270)  and  c-jiin,JunD 
(-138-- -162).  Putative  transcription  factors  that  bind  to  the  GGA  repeat  region  include  MAZ,  RXR  and  VDR.  winch  have  multiple  binding  sites 
in  the  GGA  repeat  region.  The  myb  GGA  repeat  sequences  in  diflerent  species  are  compared,  and  RI  and  R3  are  most  conserved  in  the  five  different 
species.  (B)  Lefi:  (GG.A)4  forms  a  T:H  and  tvv('  T;Hs  imcrmolccuiarly  dimerize  (o  form  a  T:H:H;T  G-qiiadruplex  (taken  from  reference  19).  Rlehi: 
(GGA)s  forms  two  T;Hs  that  dimerize  intranioleciilarly.  resulting  in  a  T:H:H:T  (j-quadruplex  (modified  from  reference  21). 


DNA  struclure  related  to  guanine  quadruplexes 
and  coiTipo,sed  of  a  guanine  tetrad  (T)  stacked  onto  a 
guanine-adenine  heptad  (H)  (Figure  IB)  (19,20).  This 
unique  G-quadruplex  was  the  first  identified 
G-qiiadruplex  structure  with  pseudo-double  chain  reversal 
loops  in  which  the  loop  base  (A)  is  pari  of  the  heptad, 
DN.A  sequences  with  Ivvo  adjacent  (GG,X)4  units. 
(GGA)x.  can  form  a  very  stable  (T,,,  =  86 'C)  liigher 
ordered  structure  by  intramoleculaiiy  stacking  two  T:H 
G-quadruplexes  on  the  heptad  plane,  resulting  in  a 
telrad;heptad:heptad:tetrad  (T:H:H:T)  DNA  structure 
(Figure  IB)  (21.22),  These  structures  form  in  the  presence 
of  physiologic  concentrations  of  potassium  ions  and  at 
neutral  pH, 

GGA  repeats  found  in  some  promoters  are  shown  to  be 
a  critical  cis  element  for  proteimDNA  interaction.  The 
eight  GGA  repeats  (GG.A)^  found  in  the  avian  SPARC 
promoter  were  shown  to  be  a  critical  positive  regulatory- 
region  with  multiple  Spl  Sp3  binding  sites  (23),  In 
addition,  the  human  homologue  of  SP.ARC,  the 
gene,  also  contains  GGA  repeats  ;tnd  tlie  region  is 
required  to  exert  promoter  activity  of  the  BM-40  gene 
(24).  These  studies  show  that  GG.A  repeats  in 
gene  promoters  can  play  critical  roles  in  regulating  gene 


expression,  and  suggest  that  the  formation  of  secondary 
DNA  structures  may  be  important  to  the  function  of  cis- 
acting  GGA  repeals. 

In  the  present  study  we  investigated  if  the  c-myb  GG.A 
repeat  region  can  form  T:H:H:T  G-quadruplexes  and  if  the 
region  is  a  critical  regulator  of  promoter  activity.  We  report 
that  the  e’myb  GG.A  repeat  region  forms  T:H:H:T 
G-quadruplexes  involving  two  (GGA)4  regions  and  acts 
as  ;i  I'epressor  element  of  the  c-myb  promoter.  However,  the 
GGA  repeat  region  is  also  essential  for  basal  promoter 
activity  of  the  c-myb  gene.  We  identified  a  transcription 
factor,  Myc-assodaied  zinc  finger  protein  (MAZ).  that 
binds  to  the  double-stranded  and  G-qiiadruplex  conforma¬ 
tions  of  the  GGA  repeat  region  and  represses  c-myb 
promoter  activity.  Our  findings  suggest  that  the  c-myb 
G-quadruplexes  act  as  a  negative  regulator  of  the  promoter 
and  that  M’.AZ  may  repress  c-myb  promoter  activity  by- 
binding  to  the  c-n-iyb  T:H:H:T  G-quadruplexes. 

MATERIALS  AND  METHODS 

Oligomicleotidcs 

The  oligonucleotide  (ODN)  containing  the  pyrazolopyn- 
midine  analog  of  guanine  (PPG)  was  synthesized  by 
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Tabic  1.  Summary  of  ODNs  used  in  this  siiuiy 


DV1S  proicction 

CD  spectra 

EMSA 

DNa\  poi  arrest 

RNA  po!  arrest 

c-Myb  GGA  repeat  region  Wildtypc 

1 

XI 

XXI 

R1  mut 

n 

XII 

R2  nnit 

III 

Xlll 

R.t  unit 

IV 

XIV 

(GG.A)s  control 

V 

V 

XVII 

R1  and  2 

VI 

VI 

PPG  Rl  and  2 

VII 

VII 

R  1  duplc.x 

VIII 

R2  ditplc.x 

IX 

R.t  duplc.x 

X 

R2  tind  ,t 

XV 

XXV 

R.t 

XVI 

XXVI 

F-or  sequence  dclails,  see  Supplementary  Data. 


Bio-Synthesis,  Inc.  (Lewisville,  TX.  USA).  All  other 
ODNs  were  synthesized  by  Operon  Biotechnologies,  Inc. 
(Huntsville,  AL,  USA).  ODNs  were  gel  purified  and  the 
concentrations  were  determined  by  using  a  spectropho¬ 
tometer.  A  summary  and  sequences  of  the  ODNs  are 
shown  in  Table  1  and  Supplementary  Data,  respectively. 

Circular  dichroisni  spectroscopy 

ODNs  for  circular  dichroism  (CD)  were  prepared  at  5  or 
1,25  pM  in  50  mM  Tris-HCl  (pH  7,5)  in  the  presence  or 
absence  of  140mM  KCl.  Samples  were  denatured  at  95''C 
and  slow'ly  cooled  to  room  temperature.  CD  spectra  were 
measured  by  a  Jasco-SlO  spectropolarimeter  (.lasco, 
Easton,  MD,  USA)  using  a  quartz  cell  of  I -mm  optical 
path  length,  an  instrument  scanning  speed  of  100  nm,  min, 
with  a  response  of  I  s,  over  a  range  of  200-  350  nm.  A  set 
of  three  scans  was  averaged  for  each  sample  at  25"C. 

Dimethylsulfatc  protection  assay 

ODNs  were  5'-end  labeled,  diluted  to  lOOnM  in  40  pi  of 
lOmM  Tris-HCl  (pH  8.0),  denatured  at  95  C,  and  slowly 
cooled  down  to  37  C  in  tlie  presence  or  absence  of 
140mM  KCl.  .An  amount  of  1  pg  of  dldC  wais  added  to 
each  sample,  and  the  sample  w-’as  treated  with  0.5% 
diniethylsulfate  (DMS)  as  a  final  concentration  for  5  min 
at  RT.  The  reaction  was  quenched  by  adding  50  pi  of  a 
stop  solution  (1.5  M  sodium  acetate  (pH  7.0),  IM 
p-mercaptoethanol.  250pg/ml  yeast  tRNA),  and  the 
DMS-treated  ODNs  were  cleaved  by  I0%(v/v)  piperidine, 
and  separated  on  a  10%  sequencing  gel. 

Polymerase  arrest  assays 

DNA-polymerase  arrest  assays  w’ere  performed  as  pre¬ 
viously  described  (25).  Briefly,  a  primer  was  annealed  to 
the  template  bearing  the  sequence  of  interest  and  gel 
purified.  The  asymmetric  primer-template  duplex  w'as 
incubated  in  the  absence  or  presence  of  140  mM  KCl  at 
37"C  overnight,  and  primer  extension  was  conducted  with 
Taq  polymerase  (Fermentas,  Hanover,  MD,  USA)  at 
47“C  for  20  min. 

RN,A  polymerase  arrest  assays  w’ere  conducted  by  using 
the  ill  vitro  transcription  kit  T7  MEGAshortscript  from 


/Vmbion  (.Austin,  TX,  US.4)  according  to  the  manufac¬ 
turer's  protocol.  The  top  sequence  of  T7  promoter  was 
annealed  to  template  ODNs  bearing  the  bottom  sequence 
of  T7  promoter  followed  by  the  c-myb  GGA  repeat 
region.  In  vitro  transcription  from  the  T7  promoter  site 
was  conducted  in  the  presence  of  rNTPs  and  alpha 
UTP  for  2h  at  37  C.  The  reaction  was  stopped  by- 
digesting  DN.A  templates  with  DNase  I,  and  the  transcrip¬ 
tion  products  were  separated  on  a  12%i  denaturing  gel. 

Cell  lines 

The  T-cell  acute  lymphoblastic  leukemia  cell  line  CCRF- 
CEM  was  purchased  from  ATCC  (Manassas,  VA,  USA): 
the  chronic  myelogenous  leukemia  (CML)  cell  line  K562 
was  purchased  from  Coriell  Cell  Repositories  (Camden, 
NJ.  LhSA);  and  the  genetically  modified  human  embryonic 
kidney  cell  line  Flp-In^^’  293  w-as  purchased  from 
Invitrogen  (Carlsbad.  CA,  US.4).  Cell  lines  were  main¬ 
tained  as  recommended  by  the  suppliers. 

Site-directed  mutagenesis 

The  wild-type  c-myb  reporter  plasmid  pMybWT  was 
constructed  by  subcloning  the  900  bp  c-myb  promoter  in 
the  plasmid  LB  178  into  a  pGL3  basic  vector  (Promega, 
Madison,'  WI,  USA).  .411  the  mutant  c-myb  reporter 
plasmids,  except  for  pMybDelR2,  were  generated  by  using 
the  QuikChange  XL  Site-Directed  Mutagenesis  Kit 
(Stratagene,  La  .lolla,  CA,  USA)  according  to  the 
manufacturer’s  protocol.  For  pMybDelR2,  mutagenic 
PCR  amplified  the  reporter  plasmid  pMybWT  excluding 
Region  2  GGA  repeats  in  the  promoter,  and  then  the  PCR 
product  was  ligated  m  order  to  circularize  it.  The  c-myb 
GGA  repeat  region  and  adjacent  flanking  region  in  the 
reporter  plasmids  were  sequenced  in  order  to  confirm  that 
there  were  no  unintended  mutations  introduced  during 
mutagenesis. 

Generation  of  Flp-In  293  MybVVT  lucifera.se  reporter 
stable  cells 

The  c-myb  reporter  plasmid  pFRT'MybWT/Luc  for 
stable  transfection  was  generated  by  cloning  the  BgllL 
Xbai  fragment  containing  the  wild-type  myb  promoter 
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and  liiciferase  coding  sequence  from  the  pMybWT 
construct  at  the  Bglll  and  Spel  sites  of  a  pcDNA5/FRT 
vector,  in  which  the  CMV  promoter  of  the  vector  has  been 
deleted  (Invitrogen,  Carlsbad,  CA,  USA).  The  sequence  of 
the  insert  was  confirmed  by  sequencing. 

The  Flp-In  293  MybWT  reporter  stable  cell  line  was 
generated  by  following  the  manufacturer's  protocol. 
Briefly,  the  pFRT/MybWT  Luc  construct  and  pOG44at 
a  ratio  of  1:9  w-ere  transfected  into  the  Flp-in  293  cells 
(Invitrogen,  Carlsbad,  CA,  LJSAj  by  using  ExGenSOO 
(Fermentas,  Glen  Burnie,  MD,  USA)  according  to  the 
company's  protocol.  Transfected  cells  were  selected  and 
maintained  with  hygromycin  at  150pg/ml  in  DMEM 
(Cellgro,  Herndon,  V.A,  USA).  After  the  selection,  stable 
transfectants  w'ere  pooled  and  e.\pa tided  for  use  in 
e.xperiments.  Successful  integration  of  the  reporter  con¬ 
struct  at  the  lip  recombinase  target  was  inferred  from  the 
phenotype  of  the  cells  (gain  of  hygromycin  resistance,  loss 
of  Zeocin®'  resistance  and  gain  of  liiciferase  expression). 

Transient  transfection  and  luciferase  assay 

CCRF-CEM  cells  in  exponential  grow'th  phase  were 
plated  at  a  density  of  6  x  10'’  cells/well  of  a  24-W'ell  plate 
in  500 pi  OPTI  MEM  (Invitrogem  Carlsbad,  CA.  USA). 
Lipofectamine  2000  (Invitrogen,  Carlsbad,  CA,  US,4)  was 
used  as  a  transfection  reagent,  and  transient  transfection 
was  conducted  according  to  the  manufacturer’s  protocol, 
using  lOng  of  the  renilla  luciferase  reporter  plasmid  pRL 
SV40  and  1  pg  of  the  c-myb  reporter  plasmids.  Firefly  and 
renilla  luciferase  activities  were  measured  by  using  a  dual 
luciferase  reporter  assay  system  (Promega,  Madison,  VVI, 
USA)  24  h  after  transfection.  Relative  luciferase  activities 
w-ere  acquired  by  normalizing  the  ratio  of  the  firefly 
luciferase  activity  to  the  renilla  luciferase  activity  of  the 
mutant  construct  w-ith  that  of  the  wild-type  construct. 
Each  transfection  of  the  c-myb  report  plasmids  was  done 
in  duplicate,  repeated  at  least  three  times,  Flp-In  293 
stable  cells  w-ere  plated  at  50%  confiuency  in  a  complete 
medium,  and  0-l.2pg  of  the  FLAG-tagged-M  AZ  expres¬ 
sion  plasmid  BRB112  were  transfected  by  using 
Lipofectamine  2000.  Twenty-four  hours  after  transfection, 
cells  were  lysed  and  luciferase  activities  w-'ere  measured. 
Since  luciferases  in  Flp-In  293  stable  cells  are  stably 
expressed,  total  protein  amount  in  the  lysate  was  u.sed  to 
normalize  luciferase  activity  of  the  samples.  Each  trans¬ 
fection  of  the  c-myb  reporter  plasmids  was  done  in 
triplicate,  repeated  at  least  three  times.  The  data  are 
presented  as  the  average  relative  luciferase  activity 
compared  to  the  w-ild-type  c-myb  promoter  from  six 
luciferase  assays  per  plasmid. 

Nuclear  extract  preparation 

K562  cells  were  harvested,  resuspended  in  RSB  bulfer 
(lOmM  NaCl,  3  niM  MgCL,  lOmM  Tris-HCl,  pH  7.5, 
0.05%o  NP-4()  and  protease  inhibitors),  vortexed.  incu¬ 
bated  on  ice  and  spun  dow-ii.  The  pellet  was  washed  once 
With  RSB  bulfer  and  resuspended  in  RIP.A  bulfer  (50  rnM 
Tris-HCl  (pH  7.5),  I'Vo  NP-40,  0.25%  sodium  deoxycho- 
late.  150mM  NaCl.  1  mM  EDTA  and  protease  inhibi¬ 
tors).  Then,  samples  were  vortexed,  incubated  on  ice  and 


spun  down.  The  supernatant  containing  nuclear  proteins 
was  stored  at  — 80'C, 

Expression  of  FL.\G-M  AZ 

FLAG-tagged-MAZ  was  expressed  from  the  plasmid 
BRB112  in  BL21  (DE,3)  pLysS  cells  (Invitrogen.  Cedar 
Creek.  TX.  LIS.A)  according  to  the  manufacturer's 
protocol.  Briefly,  BRB112  w-as  transformed  into  BL2I 
(DE3)  pLysS  cells.  Overnight  culture  was  prepared  from  a 
single  colony  and  diluted  to  1:100  in  fresh  Liiria-Bertani 
broth  with  0,4%i  gluco.se.  Pre-  and  post-induction  incuba¬ 
tions  were  2h  each  at  37%),  and  1  mM  of  IPTG  was  irsed 
for  induction  of  FLAG-MAZ  expression.  After  induction, 
bacterial  cells  were  harvested  and  lysed,  and  FLACi-MAZ, 
was  purified  by  a  FL.AG  affinity  column  according  to  the 
manufacturer’s  protocol  (Sigma-.Aldrich,  St  Louis,  MO, 
USA),  MAZ  was  also  synthesized  by  using  the  TnT'*' 
coupled  Reticulocyte  lysate  systems  according  to  the 
manufacture’s  protocol  (Promega,  Madison,  VVI,  USA). 
Briefly,  2  pg  of  the  plasmid  BRBl  12  was  used  as  a  DNA 
template  in  a  final  volume  of  50  pi,  and  coupled 
transcription/translalion  reaction  was  conducted  at  30"C 
for  90  min. 

Electrophoretic  mobility  shift  assay  (E.MS.4) 
and  competition  F.MSA 

For  EMSA  w-ith  purified  FLAG-MAZ,  1  pg  of  purified 
FLAG-MAZ  was  incubated  in  a  binding  bufl'er  (lOmM 
Tris-HCl  (pH  7.5),  50  rnM  NaCl,  1  mM  dithiothreitol 
(DDT),  1  mM  EDTA  (pH  8.0)  and  5%  glycerol)  m  the 
presence  of  0.2pmol  of  labeled,  double-stranded  probes 
and  incubated  for  20  min  at  room  temperature.  For 
EMSA  with  MAZ  synthesized  by  the  TnT"'  coupled 
Reticulocyte  lysate  system,  the  previously  described 
methods  in  (26)  were  followed.  .A  total  of  10  pi  of 
reticulocyte  lysate  containing  newly  syntliesized  MAZ 
was  used  per  sample  in  the  presence  of  the  labeled  probes 
at  12.5  nM.  For  competition  EMS.As.  5  pg  of  K562 
nuclear  extract  w-as  incubated  with  0,  5  and  20pmol  of 
double-stranded  cold  competitors  in  the  binding  bufl'er 
w-ith  1  pg  of  dldC  for  20  min  at  room  temperature,  and 
then  ().2pmql  of  the  labeled  probes  were  added  to  the 
reaction  and  incubated  for  20 min  at  room  temperature. 
Probe-protein  complexes  were  separated  on  a  4%i  non¬ 
denaturing  gel  with  2.4%  glycerol  in  (),25X  TBE  at  4’C. 

DNase  I  protection  assay 

,A  260  bp  GG.A  repeal  region  probe  was  prepared  by 
digesting  pMybW’T  with  Banll  and  Sfcl  and  labeling  by 
Klenow  filling.  .Approximately  LSOOOcpm  of  the  probe 
w-as  incubated  in  a  binding  bulTcr  (25  mM  Tris-HCl 
(pH  8,0),  50 mM  KCl,  6.25  mM  MgCL,  0.5  mM  EDTA 
(pH  8.0).  10“/o  glycerol,  0,5  mM  DTT)  for  10  min  on  ice 
in  the  presence  of  FLAG  peptides,  or  FL.AG-MAZ. 
After  adding  CaCL  and  MgCL  to  2,5  niM  and  5mM  as 
final  concentrations,  respectively,  0.0 hi  of  DNase  I 
(Invitrogen.  Carlsbad,  C.A.  LNA)  w-as  added  to  each 
sample  and  incubated  2  min  at  room  temperature.  DNase 
I  reaction  was  quenched  by  addine  a  stop  soliiiion 
(lOOmM  NaCl.  15mM  EDTA.  0.5%  SDS,  SOgganl 
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yeast  tRNA  as  final  concentrations).  The  labeled  probes 
were  recovered  via  phenol/chloroforni  extraction  followed 
by  ethanol  precipitation  and  separated  on  a  b'N  denatur¬ 
ing  gel. 

Statistical  analysis 

Microsoft  Excel  was  used  to  perform  statistical  analysis. 
Unpaired,  two-tailed  t-tests  were  used  to  determine 
statistical  differences  in  relative  lucifera.se  activities  of  the 
wild-type  and  mutant  c-myb  reporter  constructs,  and  a 
value  of  /’<().01  was  considered  significant. 

RESULTS 

The  c-niyb  GGA  repeat  region  forms  TiHiHiT 
G-quadruplexcs 

ODNs  with  the  sequence  {GGA)4  have  been  shown  to  form 
a  secondary  structure  composed  of  a  guanine  tetrad 
stacked  on  a  guanine-adenine  heptad.  Two  tetrad:heptad 
(T;H)  G-quadruplexes  can  dimerize  by  stacking  interac¬ 
tions  on  their  heptad  planes  to  form  a  T:H:H:T  structure 
(Figure  IB,  Uj).  taken  from  reference  19)  (19,20).  This 
T:H:H:T  G-quadruplex  can  be  derived  intramolecularlly 
from  a  (GGA)x  ODN  and  was  shown  to  be  very  stable, 
producing  a  melting  temperature  of  86"C  (22).  In  this 
structure,  the  two  T:H  G-quadruplex-fonning  core 
sequences.  GGAGGAGGAGG,  are  linked  by  a  single 
adenine  nucleotide  that  does  not  form  any  hydrogen  bonds 
to  stabilize  the  T:Ff;H:T  higher  order  structure  (Figure  1  B, 
right,  modified  from  reference  21)  (21,22). 

Since  the  c-myb  promoter  contains  three  nearly  perfect 
tandem  (GGA)4  repeats  (Figure  I  A),  we  speculated  that 
each  could  be  considered  a  T:H-forming  unit  and  would 
form  T:FI:H:T  structures  w'ith  slightly  longer  linker 
sequences  than  in  the  previously  characterized  (GGA)s 
ODN  (Figure  I  B,  right,  modified  from  reference  21).  We 
performed  DMS  footprinting  studies  with  an  ODN 
containing  all  three  GGA  repeat  regions  (ODN  I)  and 
ODNs  in  W'hich  each  region  was  mutated  to  (GCA)4  to 
prevent  secondary  structure  formation  (ODNs  II,  III  and 
IV)  (Figure  2.A).  These  studies  show  potassium-dependent 
partial  and  complete  DMS  footprints  in  R1  and  R2  in 
ODN  I,  respectively,  and  in  the  two  unaltered  GGA 
repeat  regions  of  ODNs  II,  III  and  IV  (Figure  2.4).  This 
pattern  of  DMS  protection  suggests  that  T:H:H:T 
structures  are  formed  w'ithin  the  c-myb  GGA  repeats, 
and  further  suggests  that  any  two  of  the  three  (GGA)4 
repeats,  such  arR2/R3  of  ODN  II.  R1/R3  of  ODN  III 
and  R1/R2  of  ODN  IV,  can  form  the  dimer  structure, 
implying  that  there  is  a  possible  competition  for  dimeriza¬ 
tion  among  the  three  GGA  repeat  regions.  Significantly, 
RI/R3  in  ODN  III  and  R1/R2  in  ODNs  I  and''lV  form^a 
strong  T:H:H:T  dimer  and  R2,  R3  in  ODN  II  forms  a  less 
stable  T:H:H:T.  Stronger  protection  of  R1  in  ODN  1 
compared  to  ODN  III  is  probably  explained  by  competi¬ 
tion  for  dimerization  partners  in  ODN  I.  where  three 
T:H;H:T  structures  (R1/R2,  R1/R3  and  R2/R3)  can 
potentially  form  and  probably  exist  in  equilibrium.  We 
found  that  the  R1  and  R3  GGA  repeat  sequences  are 


perfectly  conserved  across  multiple  mammalian  species, 
including  mice,  rats,  dogs,  cowss  and  humans  (Figure  lA), 
suggesting  that  R1  and  R3  are  evolutionarily  conserved  to 
preserve  a  DN.4  secondary  structure  that  controls  myb 
expression. 

CD  has  been  used  to  deduce  the  relative  orient¬ 
ation  (parallel  vs  antiparallel)  of  DNA  strands  in 
G-quadruplexes  (27).  and  most  forms  of  G-quadruplexes 
produce  very  characteristic  CD  spectra.  The  CD  spectrum 
of  a  T:H:H:T  from  a  (GGA)g  oligonucleotide  (ODN  V)  is 
shown  in  Figure  2B.  and  essentially  reproduces  the  CD 
signature  for  the  T:H:FI:T  structure  that  was  reported  in 
correlation  with  the  NMR  studies  from  w'hich  the 
T:H:H:T  was  identified  (22).  ODN  VI  is  an  ODN 
representing  the  c-myb  promoter  and  containing  GG-4 
repeat  regions  1  and  2  with  the  flanking  sequence  and 
intervening  sequence  between  the*GG.A  repeats  derived 
from  the  c-myb  promoter.  ODN  VI  produces  the 
characteristic  CD  signature  of  a  T:H:H:T  with  a  strong 
peak  at  264  nm  in  the  presence  of  potassium,  indicating 
T:H:FI:T  G-quadruplex  formation  by  the  ODN,  In 
order  to  verify  that  the  CD  signature  of  ODN  VI  is 
due  to  G-quadruplex  formation  by  the  ODN,  we 
tested  a  negative  control  oligonucleotide  (ODN  VII) 
with  7-deazaguanine  analog  substitutions  to  prevent 
G-quadruplex  formation,  PPG  is  a  pyrazolopyrimidine 
analog  of  guanine  in  which  the  N7  and  C8  atoms  of 
guanine  are  interposed.  NT-dependent  secondary  DNA 
structures,  such  as  G-quadruplexes,  are  usually  not 
formed,  but  the  PPG  retains  the  same  electron  density  in 
the  ring  system  as  guanine,  and  PPG  substitutions  do  not 
alter  the  optical  activity  (absorption  of  UV  light) 
compared  to  the  unfolded  form  of  the  guanine  oligo.  As 
shown  in  Figure  2B,  the  PPG-substituted  oligonucleotide 
ODN  VII  did  not  produce  the  CD  signature  of  a  T:H:H:T. 
confirming  that  the  CD  signature  of  ODN  VI  is  due  to 
T;H:H:T  formation.  ODNs  I  through  IV  also  produce  CD 
spectra  characteristic  of  a  T;H:H:T  G-quadruplex  in  a 
potassium-dependent  manner  (Figure  2C).  The  T:FI:H:T 
stability  inferred  from  the  signature  peaks  agrees  with  the 
DMS  footprinting  results,  showing  the  lowest  signature 
peak  on  ODN  II  and  high  peaks  on  ODNs  III  and  IV. 
When  taken  together  with  the  other  biochemical  studies 
presented  here,  the  CD  spectra  provide  strong  supporting 
evidence  for  the  formation  of  a  T:H;H:T  structure  by  the 
GGA  repeat  regions  of  the  c-myb  promoter. 

DNA  and  RNA  polymerase  stop  assays  have  been  used 
as  a  means  to  show  the  relative  stability  of  DNA 
secondary  structure  formation  in  a  DN.A  template 
strand  and  to  deduce  the  location  and  stability  of  G- 
quadruplexes  in  single-stranded  templates  and  plasmid 
DNA  (28,29).  Because  the  GGA  repeat  region  of  c-myb 
occurs  on  the  transcribed  strand  downstream  of  the 
transcription  start  site,  RNA  polymerase  arrest  could 
occur  in  the  c-myb  promoter  if  a  stable  G-quadruplex 
structure  was  formed.  Thus,  both  DNA  and  RN.A 
polymerase  arrest  sites  at  these  sequences  are  potentially 
relevant  to  the  disruption  of  enzymatic  function  that  could 
be  encountered  in  cells  by  G-quadrupIex  formation  in  the 
c-myb  GG.4  repeat  region.  The  results  show  that  DN.A 
polymerase  (Figure  3A  and  B)  and  RNA  polymerase 
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Figure  2.  DMS  protection  assay  and  CD  spectra.  (A)  DMS  nicthylation  patterns  are  indicated  by  open  circles  (methylated),  hall'-shadcd  circles 
(partially  methylated)  and  full-shaded  circles  (unmcthylalcd).  The  wild-type  GG.A  repeat  sequence  (ODN  !)  shows  protection  from  DMS  mcihylation 
in  R1  and  R2  in  the  presence  t)f  potassium.  The  (GGA)4  to  (GCA)a  mutants  (ODNs  H,  11!  and  IV)  show  protection  in  the  two  intact  (GGA)4 
repeats  in  the  pre.scncc  of  potassium.  Note  that  Ps  in  ODN  Vll  means  PPGs.  (B)  Circular  dichroism  spectra  of  ODNs  V.  V]  and  VII  in  the  presence 
of  KCl,  ODNs  V  and  V]  show  CD  signature  of  G-quadriiplcx  but  not  ODN  Vll.  (C)  CD  spectra  of  ODNs  i,  11.  Ill  and  iV,  ODN  11  show's  the 
lowest  G-quadrupicx  sicnaturc  peak  and  ODNs  Hi  and  IV  show  a  high  signature  peak. 


(Figure  .3C)  arrest  sites  (designated  SI  and  S2)  are 
observed  precisely  at  the  beginning  of  the  third  and 
second  GGA  repeat  regions  in  a  potassium-dependent 
manner.  These  results  are  explained  by  the  polymerase 
arrest  at  SI  from  T:H:H:T  by  the  R2/R-3  and/or  R1/R3 
regions  and  the  arrest  at  S2  by  the  R1/R2  GGA  repeat 
regions.  The  presence  of  two  arrest  sites  indicates  that  the 
c-myb  promoter  can  form  a  mixture  of  stable  secondary 
structures  due  to  the  competition  between  GGA  repeal 


regions.  In  addition,  our  OMS  footprinting  data  showing 
DMS  cleavage  in  both  Rl  and  R3  regions  of  ODN  ) 
(Figure  2A)  support  our  conclusion  that  there  is  an 
equilibrium  between  R1/R2  and  R2/R3  that  results  in 
polymerase  arrest  at  S2  and  SI,  respectively.  Furthermore, 
our  findings  imply  that  polymerase  was  not  arrested  by  a 
T:H.  because  otherwise  there  would  be  an  arrest  site 
before  Rl.  Significantly,  in  the  full-length  (FL)  c-myb 
GGA  repeat  templates  (ODNs  XI  and  XXl),  polymerase 
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Figure  3.  RNA  and  DNA  polymerase  arrest  assays.  (A)  The  vvild-lype  and  regional  mutants  ol'  the  e-myb  GGA  repeat  region  show  pota.ssium- 
dependent  Taq  polymerase  arrest  sites.  The  wild-type  GGA  repeat  sequence  (ODN  XI)  eauses  two  arrest  sires  before  R3  and  R2.  and  the  regional 
mutants  show  a  single  arrest  site  before  the  lirsi  intact  (GGA)4.  These  hands  were  quantified  by  densitometry,  and  the  percentage  of  arrest  products 
is  shown  in  the  right  pane!  of  A.  Note  that  the  regions  mutated  from  (GGA)4  to  (GCA)4  are  indicated  as  gray  ovals.  vS,  SI,  and  S2  are  DNA  pol  stop 
arrest  sites,  B  and  C.  DNA  (B)  and  RNA  fC)  polymerase  arrest  assays  w'ilh  the  regional  deletion  mutants  of  the  c-myb  GGA  repeat  region.  Deletion 
of  R1  (ODNs  XV  and  XXV)  result  in  a  single  arrest  site  at  Si.  Deletion  of  R1  and  R2  (ODNs  XVI  and  XXVI)  arrests  RNA  polymerase  at  SI  but 
not  DNA  polymerase.  The  location  of  the  arrest  .sites  (SI  and  S2)  arc  shown  schematically  relative  to  GG.A  repeat  regions  (RI.  R2  and  R3).  the 
primer  binding  site  (P).  and  the  FL  extension  product  in  the  diagram  adjacent  to  the  aiitoiadiogram. 
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CCRT-CFM  cells.  The  Rl,  R2  and  R3  deletion  mutam  pMybDelRl  2  3  markedly  reduces  Inciferasc  aciiviiv.  ‘/'.cO.Ol 


arrest  at  the  first  arrest  site  (SI)  is  only  partial,  and  the 
second  arrest  site  (S2)  is  more  pronounced,  consistent  with 
the  data  from  the  DMS  protection  assays  that  the 
T:H:H:T  formed  between  GGA  repeat  regions  I  and  2  is 
the  most  preferred,  when  all  three  regions  are  intact. 

When  one  of  the  GGA  repeal  regions  is  mutated  (ODNs 
XII.  XIII  and  XIV  in  Figure  3A),  T:H:H:T  I'ormalion 
occurs  with  the  two  intact  GGA  repeals  and  results  m  a 
single  arrest  site  at  either  SI  or  S2.  This  observation  is 
comparable  to  the  single  arrest  site  (S)  that  is  observed 
before  the  GGA  repeats  in  a  template  containing  the 
(GGA)s  sequence  (ODN  XVII  in  Figure  3B)  that  w'as  used 
as  a  positive  control  for  the  T;FI:H:T  structure  (22),  and 
indicates  that  any  two  (GGA)4  repeats  can  form  a  T:H:H:T 
regardless  of  their  positions  in  the  c-myb  GGA  repeal 
region.  Of  note,  although  any  two  (GGAlu  motifs  are  able 
to  form  a  T:H:H:T,  the  relative  amount  of  arrest  product 
compared  to  FL  primer  extension  products  varies  in  ODNs 
XI.  XII.  XIII  and  XIV.  which  provides  an  estimate  of  the 
stability  of  the  T:H;H:T  structures  formed  by  the  GGA 
repeat  sequences.  For  e.xample.  in  Figure  3A,  the  arrest 
products  at  SI  and  S2  are  substantial  compared  to  the 
amount  of  the  FL  primer  extension  product  in  ODN  XI 
(representing  the  WT  myb  sequence),  whereas  the  arrest 
product  at  SI  in  ODN  .Xll  is  only  ptirtial  compared  to  the 
FL  extension  product.  Our  D.VIS  protection  assay  data 
(Figure  2A)  and  the  percentage  of  arrest  products 
(Figure  3A)  demonstrate  that  mutation  of  GGA  repetit 
region  I  leads  to  a  weak  T:H:Li:T  structure  formed  between 
R2  and  R3. 

Deletion  studies  in  DNA  and  RNA  polymerase  arrest 
assays  demonstrated  that  dimerization  of  two  T:H 
G-quadruplexes  is  required  for  stable  quadruplex  forma¬ 
tion.  When  one  of  the  GGA  repeat  reeions  is  deleted 
(ODNs  XV  and  XXV  m  Figure  3B  and  C).  T:H:H:T 
formation  takes  place  by  dimerization  of  two  T:H 
quadruplexes  and  causes  a  single  arrest  site  in  both 
DNA  and  RNA  polymerase  assays.  However,  when  two 
GGA  repeat  regions  are  deleted  (ODNs  XVI  and  XXVI), 
a  single  T;H  structure  could  form,  and  this  structure 
partially  arrests  RNA  polymerase  (Figure  3C)  but  not 


DNA  polymerase  (Figure  3B),  suggesting  that  the  T:H 
structure  is  not  as  stable  as  the  T:H:H:T  structure,  in 
support  of  the  concept  that  stacking  interactions  between 
the  heptad  planes  of  two  T:H  subunits  lead  to  a  more 
stable  higher  order  T:H:H:T  structure. 

In  summary,  these  biochemical  assays  demonstrate  that 
two  (GGA)4  motifs  from  the  c-myb  promoter  can  form  a 
stable  potassium-dependent  secondary  DN.V  structure 
capable  of  arresting  the  progress  of  DNA  and  RNA 
polymerases  and  provide  evidence  for  the  formation  of  a 
T:H:H:T  structure  by  the  c-myb  promoter. 

The  c-myb  GGA  repeat  region  plays  a  critical  role  in 
c-myb  promoter  activity 

In  order  to  investigate  the  role  of  the  GGA  repeat  region  in 
c-myb  promoter  activity,  we  performed  luciferase  assays 
using  wild-type  and  mutant  c-myb  promoters.  The  reporter 
plasmid  pMybW'T  bears  the  wild-type  c-myb  promoter 
from  — 7I9  to  +200  with  the  three  intact  GGA  repeat 
regions.  The  mutant  reporter  plasmids  pMybDelRL 
pMybDelR2  and  pMybDelR3  are  deletion  mutants  in 
which  a  (GGA)4  motif  has  been  individually  deleted  as 
shown  in  Figure  4.  We  also  generated  serial  deletion 
mutam  plasmids,  pMybDelRl  2  and  pMybDelRl, 2  3,  in 
which  two  or  all  three  GCiA  repeat  regions  have  been 
deleted.  Figure  4  shows  relative  firefly  luciferase  activities  in 
CCRF-CEM  cells  normalized  by  renilla  luciferase  activities 
in  a  dual  luciferase  assay  system.  Deletion  of  GG.A  repeat 
region  I  or  both  regions  I  and  2  m  the  c-myb  promoter 
increased  relative  luciferase  activities  by  3-  to  3-fold.  In 
OD-Ns  representing  the  wald-type  c-myb  sequence,  Rl  and 
R2  were  the  tw’o  GGA  repeat  regions  involved  in  T:H:H:T 
formation,  and  deletion  of  Rl  or  Rl  2  was  shown  to  result 
in  an  unstable  secondary  structure  with  the  remaining 
region(s).  In  contrast,  deletii.m  of  R2  has  no  significant 
effect  on  c-myb  expression.  In  ODNs,  mutation  of  R2 
resulted  in  a  stable  T:H:H:T  between  Rl  ;tnd  R3.  Taken 
together,  these  results  support  the  conclusion  that  stable 
T:H:H:T  formation  involving  two  GGA  repeat  regions 
represses  c-myb  expression.  The  deletion  of  region  3  also 
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significantly  increased  c-myb  promoter  activity,  even 
though  corresponding  mutations  in  R3  in  ODNs  repre¬ 
senting  the  c-myb  promoter  resulted  in  a  stable  T:H:H:T 
between  R1;R2.  TheelTect  of  deleting  13.3  may  be  explained 
by  the  newly  created  transcription  factor  binding  site  that 
is  introduced  as  a  result  of  deletion  of  R3  and  then 
subsequent  rejoining.  We  found  such  a  transcription 
factor  binding  site  in  pMybDelR3,  ILF  I  (winged-helix 
transcription  factor  IL-2  enhancer  binding  factor),  by  using 
the  computer  program  Matinspector  from  GenoMatix 
(vvvvvv.genomatix.de).  ILFl  is  constitutively  expressed  and 
serves  as  a  transcriptional  activator  of  the  IL-2  promoter  in 
the  T-cell  leukemia  cell  line  Jurkat  (30),  Since  the  cell  line 
CCRF-CEM  used  in  this  study  is  a  T-cell  leukemia  cell  line, 
it  is  possible  that  the  ILFl  site  created  in  the  GG.A  repeat 
region  of  pMybDelR3  may  act  as  a  transcriptional 
activator,  w’hich  W'ould  result  in  an  increase  of  promoter 
activity  of  pMybDelR3,  We  also  examined  the  sequence  of 
pMybDelRl  and  pMybDelRl  .G  to  see  if  there  are  any 
transcription  sites  created  as  a  result  of  deletion  of  R1  or 
both  R1  and  R2.  However,  Matinspector  did  not  find  any 
newly  introduced  transcription  sites  due  to  the  deletion. 
There  are  transcription  factor  binding  sites  that  have  been 
lost  in  the  deletion  mutant  constructs  pMybDelRl, 
pMybDelR2  and  pMybDeIR  I  /2.  Nevertheless,  considering 
luciferase  activity  of  pMybDel  R I  /2/3,  the  loss  of  transcrip¬ 
tion  factor  binding  sites  does  not  seem  to  cause  an  increased 
promoter  activity. 

Although  deletion  of  Rl,,/2  and  R3  individually  increased 
c-myb  promoter  activity,  the  additional  15  bp  deletion 
of  region  3  from  pMybDelRl/2,  vvliich  results  in  complete 
removal  of  all  the  GG.A  repeats  from  the  c-myb  promoter, 
showed  almost  complete  loss  of  luciferase  activity 
in  CCRF-CEM  cells  that  ranges  only  1  3%  of  the 
pMybWT  construct.  It  is  important  to  note  that  luciferase 
activity  was  still  measurable  (~5()-fold  over  background) 
from  the  pMybDelRl/2/3  plasmids.  A  very  low  level  of 
c-myb  expression  in  response  to  deletion  of  the  entire  GGA 
repeat  region  w'as  also  observed  in  two  other  leukemia  cell 
lines,  Jurkat  and  K562  (data  not  shown).  Because  of  the 
striking  loss  of  promoter  activity  by  deletion  of  the  GGA 
repeats  in  three  cell  lines  tested,  another  pMybDelRl ,'2/3 
construct  was  tested  to  control  for  spurious  effects  from 
site-directed  mutagenesis,  and  these  results  were  confirmed 
with  a  second  construct  plasmid.  At  least  one  (GG.A)4 
repeat  region  is  required  for  c-myb  promoter  activity,  and 
we  speculate  that  duplication  of  the  (GGA)4  motif  is  u.sed 
by  the  myb  promoter  to  finely  tune  c-myb  expression  by 
equilibrium  between  secondary  structure  formation  and 
transcription  factor  binding  to  double-stranded  DNA. 
Collectively,  these  data  show  that  the  c-myb  GG.A  repeats 
are  critical  for  regulating  c-myb  expression  and  suggest  that 
T:H:H;T  secondary  DN.A  structures  formed  by  adjacent 
(GG.A)4  motifs  are  likely  negative  regulators  of  the  c-myb 
promoter. 

The  transcription  factor  -M,4Z  binds  to  the  c-myb  GGA 
repeat  region 

Since  our  data  demonstrated  that  the  c-myb  GGA  repeat 
region  is  a  critical  cis  element  for  controlling  c-myb 


promoter  activity,  we  investigated  the  possible 
protem:DN,A  interactions  m  the  region.  First,  we  searched 
for  putative  transcription  factors  for  the  c-myb  GG.A 
repeat  region  from  +2  to  73  with  Matinspector 
(Supplementary  Data).  M.AZ,  RXR,  PU.l  and  VDR 
were  identified  as  transcription  factors  that  have  multiple 
potential  binding  sites  in  the  GGA  repeat  region.  We 
performed  EMSAs  to  evaluate  the  binding  of  nuclear 
proteins  from  the  leukemia  cell  line  K562  to  the  myb 
GG.A  repeats  in  the  presence  of  competitor  oligos  with  the 
consensus  binding  sites  for  these  transcription  factors. 
Since  the  setiuence  of  the  GG.A  repeat  region  is  repetitive, 
we  divided  the  GG.A  repeat  region  into  three  regions 
(VIII,  IX  and  X),  each  of  which  includes  one  of  the 
(GGA)4  repeat  regions  (Rl,  R2  and  R3.  respectively)  for 
competition  EMS.As  (Figure  5B)  and  forms  a  stable 
duplex  with  a  7’m>64"C.  In  the  cbmpetilion  FiMSA  for 
investigating  M.AZ  binding,  w'e  first  incubated  K562  cell 
nuclear  extract  with  0.  25X  or  100.X  molar  excess  of  a  non- 
labeled  double-stranded  competitor  bearing  a  consensus 
M.AZ  binding  site  or  mutant  MAZ  binding  site,  and  then 
the  labeled  double-stranded  probes  ODN  VIII,  IX  and  X, 
representing  the  three  (GGA)4  regions  of  the  c-myb 
promoter,  were  added  to  the  reaction.  The  competition 
EMS.As  ill  Figure  5B  demonstrate  that  the  c-myb  GGA 
duplex  repeat  region  interacts  with  multiple  nuclear 
proteins,  but  several  of  the  protein: DN.A  complexes  were 
eliminated  by  the  MAZ  consensus  competitor  but  not  the 
MAZ  mutant  competitor.  We  also  used  consensus  binding 
sequences  for  PLfl,  RXR  and  VDR  in  competition 
EMS.As.  These  data  are  summarized  in  Figure  5C.  and 
the  other  competition  EMS.As  are  shown  as 
Supplementary  Data.  These  data  provide  preliminary 
evidence  that  MAZ.  PU.l,  RXR  and  VDR  have  one  or 
more  binding  sites  in  the  c-myb  GG.A  repeats. 

To  confirm  MAZ  binding  to  the  c-myb  promoter,  we 
purified  MAZ  protein  for  EMS.As  and  DNase  I  footprint¬ 
ing  studies.  Figure  6, A  show's  that  MAZ  binds  to  all  three 
duplex  probes  representing  the  three  GGA  repeat  regions 
of  the  c-myb  promoter  and  the  tw'o  M.AZ-probe 
complexes  indicates  that  M.VZ  may  bind  to  the  duplex 
probes  as  a  monomer  or  a  dimer,  DNase  I  digestion  was 
performetJ  on  a  260  bp  fragment  of  the  c-myb  promoter  in 
the  presence  of  either  a  negative  control  FLAG  peptide 
(Figure  6B.  lane  2)  or  purified  FL.AG-M.AZ  (Figure  6B. 
lane  3).  Only  FLAG-MAZ  showed  protection  from 
DNase  I  digestion  spanning  the  GG.A  repeat  region  but 
not  F’L.AG  peptide,  confirming  our  competition  EMS.A 
data  that  MAZ  has  multiple  binding  sites  in  the  c-myb 
GG.A  repeats. 

MAZ  binds  to  the  G-qiiadruplexes  formed  in  the  c-myb 
GG.A  repeat  region  and  represses  promoter  activity 

M.AZ  was  reported  to  bind  to  the  G-quadruplex  formed 
by  the  G-rich  region  of  the  diabetes  susceptibilitv  locus 
IDDM:  126),  To  test  if  MAZ  can  also  bind  ‘to  the 
G-quadruplexes  formed  in  the  c-myb  GGA  repeat  region, 
we  performed  EMSAs  with  ODNs  folded  into  T:H:H:T 
structures  in  the  presence  of  M.AZ  in  rabbit  reticulocyte 
lysate  as  de.scnbecl  in  (26).  First,  w'e  tested  ODN  VI,  which 
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Figure  5,  Competition  CMSA  with  K562  nuclear  extract.  (A)  The  DN.A  probes  used  for  EMSAs  arc  illustrated  under  the  corresponding  GGA  repeal 
sequence.  The  probes  arc  double  stranded,  but  only  the  G-rich  sequence  is  shown.  (Bl  The  results  from  competition  E-iMSAs  using  a  MAZ  competitor 
arc  shown.  The  protcin:DN,-\  complexes  completely  or  almost  completely  eliminated  by  the  M.AZ  comgK'tiior  are  indicated  with  arrow's. 
(C)  Summary  of  competition  EMSA  results.  The  putative  transcription  factors  shown  to  interact  or  not  to  intcrtict  with  the  e-myb  GG.4  repeat 
region  in  competition  EMSAs  tire  listed. 


contains  e-myb  GG.4  repeal  regions  1  and  2  and  should 
form  a  more  stable  T:H:H:T  than  ODN  I  due  to 
the  lack  of  competition  for  dimerization  in  ODN  VI. 
ODN  V  (perfect  (GGA)k  sequence)  and  ODN  Vll  (PPG- 
substituled  analog  of  ODN  VI)  were  studied  as  positive 
and  negative  controls,  respectively.  Figure  7A  shows  a 
MAZ  proteiniDNA  iiileractioii  that  is  specific  for  the 
T:H:H:T  structure.  The  band  denoted  as  the  T:H:H:T- 
MAZ  complex  is  observed  with  ODNs  V  and  VI,  but  not 
with  ODN  VII,  the  PPG-substituled  control,  and  only  in 
the  presence  of  MAZ,  Non-specific  proteimDNA  interac¬ 
tions,  presumably  from  proteins  in  the  reticulocyte  lysate, 
are  seen  with  all  three  ODNs  in  the  presence  or  absence  of 
MAZ,  We  also  confirmed  that  M.AZ  binds  to  the 
G-quadruplex(es)  formed  by  ihe  native  c-myb  pronioler 
with  all  three  (GG.Alq  repeats  (Figure  7B).  This  result 
confirms  that  MAZ  binds  to  the  equilibrium  structiire(s) 
formed  when  all  three  GG,4  repeat  regions  are  present,  but 
the  free  probe  migrates  more  slowly  than  ODNs  V,  VI  and 
VII  because  the  ODN  is  longer  (Supplementary  Data), 

We  investigtited  the  role  of  M.AZ  in  regulation  of  c-myb 
expression  via  transfection  of  the  MAZ  expression 
plasmid  BRBI12  in  Flp-la  293  Myb  WT  reporter  stable 
cells  (Figure  8).  These  Flp-in  273  cells  contain  ihe  stably 
transfected  wild-type  c-myb  promoter  and  luciferase 


coding  sequence  in  the  genome,  which  are  originally 
from  pMybWT  (Figure  8).  Forced  overexpression  of 
MAZ  decreased  the  activity  of  the  stably  transfected  myb 
promoter,  demonstrating  that  MAZ  is  a  repressor  of  the 
c-myb  promoter.  We  also  treated  the  same  myb  stable  cells 
with  a  pool  of  three  MAZ  siRNfAs  to  evaluate  the  effect  of 
M.AZ  knockdown  on  c-myb  promoter  activity.  However, 
siRNA  ireatlnent  did  not  change  c-myb  promoter  activity 
(data  not  shown).  It  is  probably  because  many  other 
transcription  factors  interact  with  and  compete  each  other 
for  binding  to  the  c-myb  GGA  repeat  region  (Figure  5). 
Knocking  down  MAZ  would  open  more  opportunity  for 
other  competitor  proteins  to  bind  to  the  region,  which 
may  result  in  no  net  changes  in  c-myb  promoter  activity. 

Collectively,  this  study  presents  the  first  evidence  that  the 
c-myb  GGA  repeat  region  forms  T:H;H:T  G-quadruplexes 
capable  of  binding  to  a  transcriptional  repressor  protein. 
The  GGA  repeat  region  acts  as  a  repressor  of  c-myb 
expression  by  folding  into  a  G-quadruplex  structure  and 
recruiting  transcriptional  repressors  such  as  MAZ,  which 
are  needed  for  basal  transcription. 

DISCUSSION 

In  support  itf  prevuius  N.VIR  studies  (19  22),  our  findings 
from  DMS  pi'oleclion.  CD  and  polymerase  arrest  assays 
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Figure  6.  Electrophoretic  mobility  shift  and  DNase  1  protection  assays 
with  ELAG-MAZ.  (A)  EMSA,  The  complexes  of  FLAG-MAZ  and  the 
c-myb  probes  aie  indicated  witit  arrows.  (B)  DNase  I  piancction  assay. 
Lane  1:  sequencing  for  guanines,  lane  2:  FL.AG  peptide  and  lane  3: 
FLAG-MAZ,  The  area  protected  from  DNase  I  digestion  is  marked 
with  a  solid  line. 


show  that  the  GG.A  repeal  region  in  the  c-myb  promoter  is 
the  first  example  of  a  genomic  .sequence  that  forms 
T;H:H:T  G-quadruplex-like  structures,  involving  two 
adjacent  (GGAjq  repeats.  Several  C-quadruplex  regions 
have  been  identified  in  promoters  and  shown  to  be  critical 
for  regulation  of  promoter  activity.  The  G-quadruplexes 
formed  in  the  c-myc  and  Kras  promoters  inhibit  promoter 
activity  (38.29).  whereas  the  G-qiiadruplex  formed  in  the 
human  insulin  gene  activates  the  promoter  (.31).  which 
suggests  that  these  G-qiiadruplex  structures  can  alter  gene 
expression.  Our  promoter-reporter  transfection  studies 
showed  that  the  c-myb  G-qutidruplex-forming  region  is 
critical  for  regulation  of  c-myb  expression  and  that 
the  deletion  of  (GGA)4  motifs  that  destabilize  the 
G-quadrupIex  increase  promoter  activity.  The  GGA 
repeat  legion  of  the  c-myb  gene  is  highly  conserved 
across  different  mammalian  species  (f  'lgure  LA),  and  the 
R1  and  R3  GGA  repeat  regions  are  absolutely  conserved  in 
all  five  mammalian  genomes  queried.  This  finding  implies 
that  R1  and  R3  are  critical  for  regulation  of  myb 
expression,  and  our  finding  that  deletion  of  either  R1  or 
R.1  significantly  alters  c-myb  transcription  supports  this 
implication.  Conversely.  GGA  repeat  region  R2  is  not 


evolutionarily  conserved,  and  deletion  of  this  region  has  no 
elTect  on  c-myb  transcription  in  our  studies.  The  c-mvb 
GGA  repeat  region  also  contains  numerous  transcription 
factor  binding  sites  in  these  three  GG.A  repeat  regions,  and 
the  c-myb  promoter  may  regulate  its  activity  bv  alterinc  the 
stability  ol  TiHiHiFs  via  changing  dimerization  partners. 

The  mechanism  by  w’hich  G-quadruplexes  can  be 
transcriptional  activators  or  inhibitors  is  not  clear.  There 
are  many  studies  showing  that  promoter  regions  that 
support  G-qiiadruplex  formation  are  usually  also  bindine 
sites  for  one  or  more  important  transcription  factors 
(32  36).  For  instance,  the  c-myc  NHE  G-quadrupIex- 
forming  legion  involves  various  DNA-binding  proteins, 
such  as  the  double-strand  binding  protein  Spl  G2),  the 
single-strand  binding  proteins  CNBP  (33)  and  hnRNP  K 
(34).  and  possibly  yet-to-be-disco\''e]-ed  proteins  that  can 
recognize  the  G-quadruplex  formed  by  this  sequence. 
These  protein  interactions  are  probably  in  fine  balance 
and  competing  lor  potentially  complementary  or  exclusive 
binding  sites.  Interaction  with  transcription  factors 
specific  for  a  certain  DNA  conformation  would  shift  the 
equilibrium  to  the  DNA  conformation  at  the  region, 
which  would  be  critical  for  determining  promoter  activity. 
Supporting  this  model  of  transcriptional  regulation  by 
recognition  of  diflerent  DNA  conformations,  the  ErbB2 
Hr-DNA  formation  in  the  GG.A  repetits  that  is  homo¬ 
logous  to  those  111  the  c-myb  promoter  displaced  the 
binding  of  the  critical  transcription  factor  Ets  in  the 
ErbB2  GGA  repeat  region,  conversely.  Ets  binding 
inhibited  Hr-DNA  formation  (37),  This  observation 
suggests  that  secondary  DNA  structures  and 
proteimDNA  interactions  can  influence  each  other  to 
either  activate  or  suppress  gene  expression.  In  agreement 
with  this  concept,  our  EMSA  results  demonstrate  that  the 
c-myb  GGA  repeat  region  actively  interacts  with  many 
nuclear  proteins  (Figure  5)  when  it  is  double  stranded, 
which  might  mean  that  T:H:H:T  formation  dow nregulates 
promotei  activity  by  simply  displacing  double-strand 
binding  nuclear  proteins. 

The  cmtyb  GGA  repeat  region  is  a  hot  spot  for  nuclear 
piotein  intei action  as  shown  in  Figure  5.  Several  previous 
studies  showed  that  GGA  repeats  in  a  gene  promoter  w'ere 
critical  transcriptional  activator  binding  sites  (23,24). 
Similarly,  our  luciferase  assay  results  demonstrated  that 
the  GG.A  repeat  region  is  essential  tor  c-mvb  promoter 
<icti\ ity.  which  IS  ttboiished  when  the  GG.A  repeats  are 
deleted  (Figure  4),  probably  due  to  the  lo.ss  of  transcrip¬ 
tion  ftictor  binding  sites.  These  results  indicate  that  the 
GGA  repeat  region  has  to  be  available  for  transcriptional 
aaivators  to  bind  to  the  promoter  and  may  imply  that 
G-quadruplex  formation,  if  stabilized  by  a  small  molecule, 
could  also  markedly  inhibit  c-myb  expression,  making  the 
region  unavailable  for  transcription  factor  binding.  The 
discovery  and  development  of  small  molecule  ligands  for 
G-quadruplexes  is  the  subject  ol  investigation  by  several 
research  groups  (38.39). 

We  identified  M.AZ  as  one  of  the  transcription  factors 
that  bind  to  the  GG.A  repeat  region  of  c-myb.  and  show-ed 
thtii  M.AZ  can  bind  to  both  double-stranded  DNA  and 
T:H:H:T  conformations  of  the  c-myb  promoter,  MAZ 
was  first  identified  as  a  GA  box  binding  transcription 
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Figure  7.  MAZ  binding  to  the  G-qtiadrtiplcxcs  formed  by  GGA  repeat  DNA.  (A)  tVISA  with  ODNs  V.  VI  ;ind  VII  in  the  presence  or  absence  of 
M.AZ  in  rabbit  reticiikacytc  lysate.  MAZ  in  rabbit  reticulocyte  lysate  binds  to  the  T:H:H:T  Gwiiiadriiplcxcs  formed  in  OD.Ns  V  and  VI  but  docs  not 
bind  to  the  ODN  Vll  that  cannot  form  a  G-quadriipIcx.  DN.V-protein  complexes  formed  with  M.AZ  or  nonspecific  lysate  protein  and  the  probes  arc 
indicated  with  arrows.  (Bi  EMS.A  with  ODNs  V  and  I.  ODN  V,  a  positive  control  for  .MAZ  binding  to  a  T:H:H:T.  confirms  that  M.AZ  also  binds  to 
the  wild-type  c-myb  GG.A  repetit  sequence  ODN  I.  Due  to  the  difterent  size  of  ODNs  I  and  V.  the  free  probes  show  difierent  mobility. 
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Figure  8.  Transfection  of  the  MAZ  expression  plasmid  BRBIId  into 
the  Flp-ln  293  Myb  WT  reporter  stable  cells.  0  l.2pg  of  BRBI  12  were 
irtinsfecled  into  the  Flp-ln  293  Myb  WT  reporter  stable  cells  ;ind 
lueiferase  activities  were  measured  24  h  after  lixinsfcction.  M  AZ 
overexpression  rcstiltcd  in  inhibition  of  c-myb  promoter  activity. 
■RwO.Ol. 

factor  in  the  c-myc  promoter  and  was  shown  to  control  PI 
and  P2  promoter  activities  of  the  c-myc  gene  (40  43). 
MAZ  is  coiistitutively  expressed  in  various  tissues  (44). 
but  abnormal  MAZ  expression  is  found  in  the  terminal 
phase  of  CML.  suggesting  that  overexpression  of  MAZ 
plays  a  role  in  the  progression  of  CML  (45).  The 
functional  role  of  MAZ  is  dependent  on  the  target  gene, 
and  some  promoters  are  activated  by  MAZ  (46-48),  while 
others  are  repressed  (49,50).  One  possible  explanation  for 
the  dual  roles  of  MAZ  may  be  that  different  cofactors 
interact  with  MAZ  depending  on  the  target  promoter. 
Song  et  (d.  (50)  showed  that  MAZ  acts  as  a  negative 
regulator  via  physical  interaciion  with  HDAC  (histone 
deacetylase  complex),  and  Jordan-Sciutto  e!  ai  demon¬ 
strated  that  MAZ  interacts  with  FACl  (Fetal  Alzheimer’s 
clone  1).  a  truncation  of  the  chromatin  remodeler  BPTF 
(bromodomain  and  PFID  domain  transcription  factor) 
(51,52),  Interestingly.  MAZ  and  the  MAZ-like  protein 
THZif-1  (triple-helix-binding  zinc-finger  proiein-1)  have 


now'  been  show'n  to  bind  to  secondary  DN.4  structures 
(26.53).  The  G-quadruplex  structure  formed  in  the 
IDDM2  locus,  a  critical  regulator  of  1DDM2  promoter 
activity,  is  recognized  by  MAZ;  how'ever,  MAZ  does  not 
bind  to  the  unfolded  form  of  the  G-quadruplex-forming 
region  (26).  Moreover,  the  MAZ-like  protein  THZif-1  w-as 
shown  to  interact  w'lth  the  triplex  formed  in  the  c-myc 
NHE.  w'hich  is  both  crucial  for  c-myc  promoter  activity, 
and  more  recently  investigated  by  several  groups  as 
a  G-quadruplex  forming  element  (28,53  55).  These 
observations  suggest  that  regulation  of  promoter  activity 
by  M.AZ  and  related  proteins  may  involve  recognition 
of  secondary  DNA  structures.  Some  proteins,  such  as 
R.4P1  (Repressor-activator  protein  1),  Thrombin  and 
DNA  topoisomerase  F  have  been  shown  to  stabilize 
G-quadruplexes  (56  58).  Other  proteins,  including  RecQ 
helicases,  RPA  (Replication  Protein  A)  and  POTl 
(Protection  of  Telomere  1),  may  inhibit  secondary 
structure  formation  or  serve  to  resolve  secondary  DNA 
structures  (59  -61).  In  the  case  of  G-qiiadruplexes  formed 
in  gene  promoters,  we  propose  that  these  genes  are 
regulated  by  mechanisms  involving  single-stranded, 
double-stranded  and  folded  DNA-binding  proteins  that 
may  inhibit  or  promote  G-quadruplex  formation. 
Furthermore,  it  is  possible  that  some  proteins  can  bind 
to  more  than  one  DNA  conformations,  as  we  have  shovvn 
W'lth  M.AZ  in  this  study  (Figures  6  and  7).  Since  MAZ  has 
dual  binding  activities  to  both  double-stranded  and 
G-quadruplex  DNA,  regulation  of  promoter  activity  by 
MAZ  might  be  determined  by  the  cofaclors  that  interact 
with  M.AZ.  It  is  not  known  how  MAZ  recruits  different 
cofactors,  but  it  may  depend  on  levels  of  MAZ  in  the 
nucleus  or  on  postlranslational  modifications,  such  as 
phosphorylation  of  M.-AZ.  It  is  also  possible  that  MAZ 
binding  lo  double-stranded  or  G-quadruplex  DN.-A  could 
result  in  changes  in  available  transactivation  domains  of 
M.AZ,  which  may  allow  different  groups  of  transcription 
factors  to  interact  with  MAZ  to  determine  its  role. 
Moreover,  the  c-myb  GGA  repeat  region  is  the  first 
identified  G-qiiadruplex-forming  element  in  a  transcribed 
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Figure  9.  Working  model  for  regulation  ofc-niyb  proiiioicr  aeliviiy  via 
G-quadruplex  formation  in  the  e-niyh  GGA  repeal  region.  (A)  The 
equilibrium  between  the  duplex  form,  opened  form  and  G-quadruplcx 
structure  at  the  c-myb  GGA  repeat  region.  In  the  duplex  form, 
transcriptional  repressors  and  activators  bind  to  the  c-myb  GGA  repeat 
region  in  order  to  regulate  c-myb  expression  levels,  and  MAZ  possibly  acts 
as  a  repressor  of  c-myb  promoter  by  binding  to  this  form.  In  the 
G-quadruplcx  structure,  duplex-binding  proteins  arc  dissociated  from  the 
GGA  repeal  region  and  G-quadruplcx  binding  proteins  regulate 
G-qiiadruplex  stability.  MAZ  may  downrcgulate  c-myb  promoter  ac(iviry 
via  G-quadniplex  binding  at  the  GGA  repeal  region.  (B)  Schematic 
representation  of  G-rieh  strand  of  the  GGA  repeat  region.  When  locally 
unwound  the  c-myb  GGA  repeat  region  could  form  T:H;H:T 
G-quadruplexes  with  any  two  of  three  (GGA)4  repeats  in  the  region. 


region  that  indicates  that  T;H:H:T  formation  in  the  c-myb 
promoter  may  play  a  role  as  a  transcriptional  repressor 
not  only  by  changing  transcription  factor  interaction 
but  also  by  acting  as  a  roadblock  to  transcriptional 
machinery.  Forced  overexpressioii  of  M.4Z  dramatically 
downregulates  c-myb  promoter  activity  (Figure  8), 
indicating  that  M,4Z  is  a  transcriptional  repressor  of  the 
c-myb  promoter.  However,  these  data  do  not  show 
whether  repression  of  c-myb  expression  is  due  to  MAZ 
binding  to  duplex  DNA  or  to  the  T:H:H:T  structure. 

The  c-myb  GGA  repeat  region  is  very  complex, 
involving  DNA  conformational  changes  from  double- 
stranded  to  G-quadruplex  forms  that  subsequently  alter 
transcription  factor  binding  to  the  region.  In  spite  of  this 
complexity,  we  have  shown  here  the  importance  of  the 
T:H:H:T  G-quadruple.x-forming  region  of  the  c-myb 
promoter  in  controlling  promoter  activity,  identified 
MAZ  as  a  T:H:H:T  binding  transcription  factor,  and 
pre.sented  preliminary  evidence  that  MAZ  is  a  repressor  of 
the  c-myb  promoter.  Further  study  is  needed  to  discrimi¬ 
nate  DNA  conformation  dependent  roles  for  MAZ  in 
regulation  of  the  c-myb  promoter,  to  identify  the  coFictors 
that  bind  to  MAZ  depending  on  DNA  structure,  and  to 
elucidate  what  determines  M.XZ  binding  to  either 
G-quadruplex  or  double-stranded  DNA. 
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